【Quant】support ue8m0 for fp8_quant_blockwise #77153

liuruyan · 2025-12-30T09:24:48Z

PR Category

Operator Mechanism

PR Types

New features

Description

为fp8_quant_blockwise升级支持ue8m0类型scale

paddle-bot · 2025-12-30T18:16:28Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

codecov-commenter · 2026-01-05T09:43:32Z

Codecov Report

❌ Patch coverage is 0% with 19 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@a977b32). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
paddle/phi/infermeta/unary.cc	0.00%	19 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             develop   #77153   +/-   ##
==========================================
  Coverage           ?    0.00%           
==========================================
  Files              ?        1           
  Lines              ?       19           
  Branches           ?        0           
==========================================
  Hits               ?        0           
  Misses             ?       19           
  Partials           ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

zyfncg · 2026-01-15T03:07:38Z

paddle/phi/ops/yaml/ops.yaml


 - op: fp8_quant_blockwise
-  args: (Tensor x, float epsilon, bool using_1x128_vec_quant, bool input_transpose, bool output_scale_transpose, bool return_transpose_only, bool using_e5m2, bool using_pow2_scale)
+  args: (Tensor x, float epsilon, bool using_1x128_vec_quant, bool input_transpose, bool output_scale_transpose, bool return_transpose_only, bool using_e5m2, bool using_pow2_scale, bool using_ue8m0_scale)


using_pow2_scale 和 using_ue8m0_scale 之间会有影响吗？

没有影响，using_pow2_scale代表使用2的幂次scale，但是类型仍为float32。using_ue8m0_scale代表不仅仅使用2的幂次scale，并且输出为int32(4个ue8m0)。当两个同时开启时会以using_ue8m0_scale为准。

并且单测中存在笛卡尔积测试样例。两者不会互相冲突

作为正式api的话，这些细节需要在api文档里说明，否则使用者只能通过试运行来确定参数作用

好的，现在是所有attr都没有注释说明，我下一个PR来在python api处为这个算子补充一下完整的注释吧。

support quant ue8m0

f1c5e26

add ut

b6c458c

fix infermeta

c43d27a

liuruyan changed the title ~~support quant ue8m0~~ 【Quant】support ue8m0 for fp8_quant_blockwise Jan 6, 2026

zyfncg previously approved these changes Jan 6, 2026

View reviewed changes

SigureMo previously approved these changes Jan 6, 2026

View reviewed changes

qingqing01 previously approved these changes Jan 8, 2026

View reviewed changes

risemeup1 added the skip-ci: coverage label Jan 12, 2026

Merge branch 'PaddlePaddle:develop' into quant_ue8m0

23271a6

github-actions bot removed the skip-ci: coverage label Jan 12, 2026

risemeup1 added the skip-ci: coverage label Jan 13, 2026

liuruyan added 2 commits January 14, 2026 11:38

fix conflict

b904b9d

update ue8m0

8196010

liuruyan dismissed stale reviews from zyfncg, SigureMo, and qingqing01 via 8196010 January 14, 2026 11:32

github-actions bot removed the skip-ci: coverage label Jan 14, 2026

liuruyan closed this Jan 14, 2026

liuruyan reopened this Jan 14, 2026

SigureMo approved these changes Jan 15, 2026

View reviewed changes

zyfncg reviewed Jan 15, 2026

View reviewed changes

risemeup1 added the skip-ci: coverage label Jan 15, 2026

zyfncg approved these changes Jan 15, 2026

View reviewed changes

qingqing01 approved these changes Jan 15, 2026

View reviewed changes

liuruyan merged commit 09d88f9 into PaddlePaddle:develop Jan 15, 2026
95 of 102 checks passed

liuruyan mentioned this pull request Jan 16, 2026

cp_fp8_quant #77366

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【Quant】support ue8m0 for fp8_quant_blockwise #77153

【Quant】support ue8m0 for fp8_quant_blockwise #77153

Uh oh!

liuruyan commented Dec 30, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Dec 30, 2025

Uh oh!

codecov-commenter commented Jan 5, 2026 •

edited

Loading

Uh oh!

zyfncg Jan 15, 2026

Uh oh!

liuruyan Jan 15, 2026 •

edited

Loading

Uh oh!

zyfncg Jan 15, 2026

Uh oh!

liuruyan Jan 15, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

【Quant】support ue8m0 for fp8_quant_blockwise #77153

【Quant】support ue8m0 for fp8_quant_blockwise #77153

Uh oh!

Conversation

liuruyan commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Dec 30, 2025

Uh oh!

codecov-commenter commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

zyfncg Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

liuruyan Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zyfncg Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

liuruyan Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

liuruyan commented Dec 30, 2025 •

edited

Loading

codecov-commenter commented Jan 5, 2026 •

edited

Loading

liuruyan Jan 15, 2026 •

edited

Loading

liuruyan Jan 15, 2026 •

edited

Loading